Spread Lips + Raised Larynx + Higher F0 = Smiled Speech? – An Articulatory Synthesis Approach
نویسندگان
چکیده
We present an initial study on how to model smiled speech with an articulatory speech synthesizer, led by the research question as to what cues are responsible for the effect of an audible distinction of smiled vs. non-smiled speech. In a perception test, we explore the relative contributions of i) spreading of the lips, ii) raising of the larynx, and iii) raising of the fundamental frequency. 36 test subjects assessed isolated synthetic vowel stimuli of /a:, i:, y:, u:/ on a 5-point “smiley scale”. Results indicate that F0 is the main acoustic factor for perceiving smileyness. The other factors depend on the vowel quality, with best results for the unrounded vowels /i:/ and /a:/.
منابع مشابه
Vocal pitch discrimination in the motor system.
Speech production can be broadly separated into two distinct components: Phonation and Articulation. These two aspects require the efficient control of several phono-articulatory effectors. Speech is indeed generated by the vibration of the vocal-folds in the larynx (F0) followed by ''filtering" by articulators, to select certain resonant frequencies out of that wave (F1, F2, F3, etc.). Recentl...
متن کاملArticulatory synthesis from x-rays and inversion for an adaptive speech robot
This paper describes a speech robotic approach to articulatory synthesis. An anthropomorphic speech robot has been built, based on a real reference subject’s data. This speech robot, called the Articulotron, has a set of relevant degrees of freedom for speech articulators, jaw, tongue, lips, and larynx. The associated articulatory model has been elaborated from cineradiographic midsagittal prof...
متن کاملAcoustics vs. articulation in articulatory speech synthesis: One vocal tract target configuration has more than one sound
for ESSV 2010 (by Eva Lasarcyk): Acoustics vs. articulation in articulatory speech synthesis: One vocal tract target configuration has more than one sound. The goal of this contribution is to illustrate the importance of the acoustic settings of articulatory speech synthesis when using it for perception/validation experiments regarding the relationship between articulation and fine phonetic det...
متن کاملThe Sound of Deception - What Makes a Speaker Credible?
The detection of deception in human speech is a difficult task but can be performed above chance level by human listeners even when only audio data is provided. Still, it is highly contested, which speech features could be used to help identify lies. In this study, we examined a set of phonetic and paralinguistic cues and their influence on the credibility of speech using an analysis-by-synthes...
متن کاملAcoustic to articulatory inversion
The context of this work is speech analysis. The subject deals with acoustic-to-articulatory inversion, i.e. the recovery of the temporal evolution of the vocal tract shape from the signal. This topic is important because it is likely to give rise to applications in the domains of speech coding as well as second language learning. Acoustic-to-articulatory inversion relies on an analysis by synt...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2008